Deriving General Association Rules from XML Data
نویسندگان
چکیده
XML documents have become poplar because the semi-structure nature of XML allows a wide variety of data to be represented in XML. Association rule mining is an important problem in the data mining domain. Currently, the problem of association rule mining on XML data has not been well studied. Existing work only addresses the problem of mining specific association rules from XML data. Such techniques specify antecedent and consequence to particular elements, and then mine rules with those specific antecedents and consequences. These techniques can not be used to mine general association rules. In this paper, we address the problem of deriving general association rules form XML data and propose an approach to perform the task. We implement our approach using Java DOM and test our algorithm on market basket data represented in XML
منابع مشابه
XML-Enabled Association Analysis
The discovery of association rules from large amounts of structured or semi-structured data is an important data mining problem [Agrawal et al. 1993, Agrawal and Srikant 1994, Miyahara et al. 2001, Termier et al. 2002, Braga et al. 2002, Cong et al. 2002, Braga et al. 2003, Xiao et al. 2003, Maruyama and Uehara 2000, Wang and Liu 2000]. It has crucial applications in decision support and market...
متن کاملA New Model for Discovering XML Association Rules from XML Documents
The inherent flexibilities of XML in both structure and semantics makes mining from XML data a complex task with more challenges compared to traditional association rule mining in relational databases. In this paper, we propose a new model for the effective extraction of generalized association rules form a XML document collection. We directly use frequent subtree mining techniques in the disco...
متن کاملDeriving Relation Keys from XML Keys
Much work on XML data was around storage and querying and did not consider constraints of XML, especially keys. Since constraints have been proposed in many papers for XML, much research work on constraints has been being done. In this paper, we consider an important class of constraints, XML keys, and try to find the relationship between XML keys and relation keys. Given XML data whose semanti...
متن کاملMining tree-based association rules from XML documents
The increasing amount of XML datasets available to casual users increases the necessity of investigating techniques to extract knowledge from these data. Data mining is widely applied in the database research area in order to extract frequent correlations of values from both structured and semistructured datasets. In this work we describe an approach to mine Tree-based association rules from XM...
متن کاملMining Association Rules from XML Data
The eXtensible Markup Language (XML) rapidly emerged as a standard for representing and exchanging information. The fastgrowing amount of available XML data sets a pressing need for languages and tools to manage collections of XML documents, as well as to mine interesting information out of them. Although the data mining community has not yet rushed into the use of XML, there have been some pro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003